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. I will speak about a number of open problems in queueing. Some of them are 

I known for decades, some are more recent. They relate to stability and to rare events. 

There is an idea to prepare a special issue of QUESTA on open problems, and 
this text may be considered as a prospective contribution to that. The choice of open 



o 

(N 



Qj I problems reflects the speaker's own interests, and should not be taken as suggesting 

^0 ' that these are the only, or even most important, problems! 

p^i 1 Mult i- server queue with First-Come-First-Served discipline 
Oh' 

(-H I A system with a finite number of identical servers and with FCFS service disciphne is one 

I of the simplest models in queueing theory. It has been known for a long time. To the best 

of my knowledge, Kiefer and Wolfowitz were the first who studied it rigorously. 

1.1 Convergence in the total variation to the stationarity 

\^ I Consider first a single-server first-come-first-served (FCFS) queue G/G/1 with interarrival 

^ I times {tn} between the arrivals of the n^^ and (n + 1)** customers, and service times of 

^ the n^^ customer {(T„}. Assume that the two-dimensional sequence {tn,crn} is stationary 

■ ergodic and that the queue is stable, that is EcJi < Eti. 
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Let Wn, n > 1 be the waiting time of customer n in the system (before the start of its 
service). Then 



X; M^i=x>0, and W^+i = max(0, VF„ + fT„ - t„) = (W„ + Cn)+, n>l 



where x is the initial delay, = — tn, and = max(0, x). 
Let = and 5„ = ■^j , ?t- > 1 . Clearly, 

Wn = max(0, X + Sn, Sn - Si, Sn - S2, . . . ,Sn - Sn-l) 

which implies that there exists a proper limiting distribution which does not depend on x. 
In other words, there exists a unique stationary distribution of the waiting time and, for 
any initial delay, there is a convergence to stationarity, and the convergence is in the total 
variation norm. 

There are many known ways to establish this result. The best one seems to use the "Loynes 
scheme". Without loss of generality, we may assume {an,tn) to be defined for all —00 < 
n < 00. Let 5o = and Sn = Sj=i ^-j, n>l. Then 

Wn =st M^^') := max(0, X + Sn, Sn-l, ...,Si). (1) 
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Denote M = sup„>o5'„. Prom the SLLN, 5„ —oo a.s., so M < oo a.s. Moreover, the 
time 

u = u^'^'^ = max{n > : x + S'n, > 0} 
is finite a.s. and, therefore, 

mI^'i=M for all n>v^''\ 

So, M is the unique limiting distribution and the convergence in total variation follows 
from the coupling inequality: for any 2; > 0, 

sup \P{Wn G ^)-P(M eA)\ = sup \P{M^J^^ G A)-P(M G A)\ < P(zy(^') > n) ^ 0, n ^ 00. 

In particular, if a; = 0, then - as follows from ([T]) - the sequence M„ := Mn \ n > 1 is 
monotone increasing a.s. and couples with M, starting from time u^^^ + 1. 

Now let m > 1 be a positive integer and consider the G/G/m FCFS queue with interarrival 
times {tn} and service times {cr„}. Assume again that the two-dimensional sequence 
{tnjCTn} is stationary ergodic and that the system is stable. Here the stability means 
E(Ji < mEti. 

Consider Kiefer-Wolfowitz vectors of virtual waiting times W„ = {Wni, ■ ■ ■ , Wnm) which 
satisfy the recursion 

Wi = X > and W„+i = i?(W„ + eicj„ - n > 1 

where x is a vector of initial delays, ei = (1,0, ... ,0) is a unit vector, 1 = (1, 1, . . . , 1) 
is a vector of units, and operator R rearranges coordinates of a vector in weak ascending 
order. In particular, Wn^i is a waiting time of customer n. 

It is known that, in general, in a multi-server queue there may be many stationary regimes 
[2Q| : there exist the minimal and the maximal stationary distribution [21 E], and there are 
some relations between stationary distributions [7]. Similarly to the one-dimensional case, 
one can define vectors M^f'' which satisfy a recursion which is more complex than ([1]). In 
particular, if there are no initial delays, x = 0, then the vectors M„ = M^^'* are monotone 
weakly ascending (coordinate-wise and a.s.) and converge a.s. to a limiting random vector 
which has the minimal stationary distribution TTmin (see e.g. [2S])- But this implies only 
the weak convergence of the distributions of these vectors, and not the convergence in the 
total variation. 

If the sequence {{crn,tn)} satisfies in addition some good "mixing" properties (say, is i.i.d. 
or regenerative), then one can again show the uniqueness of the stationary regime and 
the convergence in the total variation starting from each initial value, using either Harris 
properties (in the Markovian case, see e.g. [21]) or the renovation techniques (in a more 
general setting, see e.g. [1]). 

The conjecture is: assuming only that the sequence {{(Tn,tn)} is stationary ergodic, that 
the stability condition holds, and that the initial value is 0, then there is no need for any 
further restriction to establish the convergence in the total variation: 

sup |P(W„ eA)- tt{A) 1^0, n^oo. 
A 

There have been a number of unsuccessful attempts to prove this conjecture (see, e.g., 
[23]). 
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1.2 Existence of Moments 



Assume now that {(T„} and are two i.i.d. sequences that do not depend on each other. 
Continue to assume the stabihty condition p := EcJi/Eti < m holds. Recall that in this 
case the stationary distribution is unique. 

Denote by D the stationary waiting time in the multi-server queue. Fix 7 > and formulate 
the following question: what are the conditions for ED'^ to be finite. 

A correct (but partial!) answer to this question has been obtained recently by Scheller- 
Wolf and Vesilo [26]. To be completely exact, the result below was not formulated by these 
authors but may be deduced from their results. 

Denote by Bj the integrated service time distribution, 

Bj{x) = 1 - min (^1, j P(f7i > . 

Let <T/^i, . . . , (T/,m be i.i.d. random variables with common distribution Bi. 

Proposition Assume that p is not an integer and denote by A; G {0, 1, . . . , m — 1} its 
integer part. Then 

ED'^ < cx) iff E (min((T/^i, . . . , (J/^m-A;))''' < 00. 

The proofs of the results in [2S] are based on the construction of an auxiliary, so-called 
"semi-cyclic" service discipline. A direct proof of the proposition may be found in [13] and 
is based on ideas close to ideas of Keifer and Wolfowitz |16| . 

An open problem here is: what are the conditions for existence (finiteness) of power 
moments of D if p is an integer. 



1.3 Rare events 

Assume again that {o"„} and are two i.i.d. sequences that do not depend on each 
other. Continue to assume the stability condition p := EcJi/Eti < m to hold. 

Again let D be the stationary waiting time. We formulate the following questions: what 
may the asymptotics for P{D > x) be when x is large and what is the "typical" sample path 
which lead to such a large value of the stationary waiting time. To answer (only partially!) 
these questions, we need further restrictions on the distribution of service times. 

First, we assume that the common distribution of service times is heavy-tailed, i.e. for any 
c > 0, its cth exponential moment does not exist, Ee'^'^^ = 00. Whitt [27] formulated the 
following conjecture: ifk<p<k + l for an integer k < then 

P{D > x) J (Bj{r]x))"^~^ as x ^ 00 (2) 

"where 7 and r] are positive constants (as functions of x)" [sic, [27]] and where Bj is the 
tail of the integrated service time distribution. Intuitively, formula jj]) says that the main 
cause for the stationary waiting time to be large is to have m — k big service times in the 
past. 

In [12j and [13| we show that if the distribution of service times is intermediate regularly 
varying and if p is not an integer, then the conjecture of Whitt is correct with 7 = 7(3;) 
squeezed between two positive constants and with tj being a constant. Also, the conjecture 
holds if p £ (0, 1) and if the distribution Bj is any subexponential distribution. Also, we 
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found that if m = 2, p < 2, p 7^ 1, and the service times distribution is again intermediately 
regularly varying, then 7 is a constant. 

The open problems here are (I formulate them in the particular case of a two server 
queue, m = 2): find the asymptotics for P{D > x) 

(i) if p = 1 - at least, for some particular subexponential (say, regularly varying) distribution 
of service times; 

(ii) if p € (1,2) and if the distribution of service times is heavy-tailed but has all power 
moments finite, for example, if P(fTi > x) = e"^'', for some (3 G (0, 1). 

It would be great to understand what are in these cases the "typical" paths which lead to 
large values of D. 

2 Further problems on multi-server queues 

Consider again the multi-server (say, 2-server) queue with stationary and ergodic input 
{tn, Cn}, but assume now that the discipline is "join-the-shortest-queue": there are individual 
queues in front of the servers, and each arriving customer joins immediately the shortest 
queue (or one of the shortest at random if there are many). 
So, here are the open problems: 

- how many stationary regimes may exist if we do not assume any extra condition in 
addition to the obvious stability condition? 

- what are the minimal requirements for the uniqueness of the stationary regime? 

- under what conditions (none?) do we have weak (or TV) convergence? 

The model exhibits NO monotonicity. It is not amenable to Loynes-type schemes. It is 
entirely open ([18j). 

3 Greedy service mechanism 

There are many circumstances in our life where we may ask the question, is a "locally 
optimal" ("greedy") mechanisms also optimal in the long-run? Below are examples of 
mathematical models where the answer to such a question is open. 

There are two continuous state space models where the stability conjecture is obvious, 
but nobody is able to verify it. In both models, the driving algorithm contains a "locally 
optimal" ("greedy") element. It looks like none of the existing stability methods works here. 

3.1 Stability of a greedy server 

A single server is located on the circle. Particles arrive in a Poisson stream of rate A and are 
uniformly distributed (as material points) on the circle (people say that there is a "Poisson 
rain" of particles). It takes a single unit of time to serve a particle. After any service, the 
particle disappears, the server chooses to serve next the closest particle and moves to it 
with a (positive finite) constant speed (ignoring new arrivals), serves it during another unit 
of time, then chooses the next closest particle and moves to it, etc. 

The conjecture is: this model is stable for any A < 1. A plausible "proof" might be 
as follows: if the number of requests is very large, then the server is busy with service 
almost all the time (with a service speed close to one), and then we may apply, say, fluid 
approximation ideas to deduce the stability. 
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This model and this conjecture have already been known for more than 20 years, see [5], 
but nobody has been able to succeed with obtaining either a proof or a counter-example 
here. The key problem is the continuity of the state space, and there are several results 
(see, e.g., [HI [IHl [2l] for further details) with the proof of a similar hypothesis for models 
with a finite state space (for instance, you may replace the continuous circle by a finite 
lattice on it). If the server uses any "state-independent" algorithm for moving (say, always 
walks in the left direction or chooses the next direction with probability 1/2 independently 
of everything else), then it is easy to verify the conjecture using the ideas explained above 
- see, e.g., [HI [T^. 

3.2 Stability of a model with two streams: stream of customers and 
stream of servers 

Again, there is a circle, but this time no server or service. Instead, there are two independent 
Poisson streams/rains, of "black" and of "white" particles, with rates A and 1, respectively. 
Black particles arrive at the circle and stop there, but white particles pass straight through 
the circle (this means they "arrive and immediately disappear"). There is given a distance 
e > 0. When a white particle passes through the circle at some point, it observes all blacks 
in the e-neighbourhood and takes (deletes) the one which is the closest to itself (if there 
are any black particles at that instant). 

The natural conjecture is: stability should be guaranteed by the condition A < 1, independently 
of the circle length and the number e. But the problem is open too. Again, there exist simple 
proofs for stability if the model is modified: if either the continuous state space (the circle) 
is replaces by a finite set, or the greedy mechanism is replaced by any state-independent 
mechanism (for instance, if a white particle takes one of blacks from the neighbourhood 
"at random", with equal probabilities) (see [1]). 

4 Stability may depend on the whole distribution 

The conjecture is: even in simple queueing systems, the stability conditions may depend 
both on the initial values and on the whole distribution of the driving sequences. 

Here is an example where the conjecture may be true (see [8] for more detail). 

Consider a system with three servers (numbered 1 to 3) fed by a Poisson process with 
intensity A. There are three classes of customers and each arriving customer becomes a 
class i customer (i = 1,2,3) with probability 1/3. Class 1 customers may be served by 1st 
and 2nd servers (where 1st server is "left" and 2nd server is "right"), class 2 customers by 
2nd and 3rd servers (here 2nd server is "left" and 3rd is "right"), and class 3 customers by 
3d and 1st servers (here 3rd is "left" and 1st is "right"). Upon arrival, a customer chooses 
an accessible server with the shorter workload. There are two probability distributions, Fi 
and Fr, and a customer's service time has distribution Fi if it is served by its left server 
and Fr otherwise. 

Simulations show that the conjecture may hold, but there is no a rigorous proof. 
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5 Random fluid limits and positive Lebesgue measure of the 
area of nufl-recurrence 

Consider an open polling system with two stations and two "heterogeneous" servers. Each 
station i = 1,2 has a Poisson input with intensity Aj = 1. For i,j S {1,2}, service times 
of server j at station i are i.i.d. exponential with intensity n[ . Both servers follow the 
so-called exhaustive service policy: after completing a service, a server either starts with 
a service of a new customer (if there is any), or leaves the station for the other one. We 
assume for simplicity that "walking" ("switchover") times are equal to zero. If there is no 
free customer at either station, the server becomes "passive". 

The system described has a nice fluid model where all fluid limits are random and piecewise 
deterministic. The system is characterised by 4 parameters {^J.[''^}iJ=l,2, and all of them 
have to be less than one in order to make the system stable (but this is definitely not 
sufficient for stability). 

The conjecture here is: in the positive 4-dimensional cube, the set of parameters {/xp^} 
for which the system is "null-recurrent" has a positive Lebesgue measure. See [H] for more 
details. 

6 Mult i- access channel with protocols based on partial information 

Consider a single channel which is shared among many users and transmits packets (messages) 
of a single length. Assume that time is slotted and each service time is equal to the slot 
length. The number of packages arriving into the system during a time slot is Poisson with 
parameter A. At the beginning of time slot n, each package is trying to be transmitted 
with the same probability Pn independently of everything else. If two or more packages 
try to transmit simultaneously, the transmissions collide and packages stay in the system 
and have to try again later. If there is only one transmission, then it is successful and the 
package leaves the system. If there are no transmissions, then the slot is empty. Denote 
by Wn the number of packages in the system at the beginning of nth time slot. Assume 
that probabilities pn are defined inductively and that Pn+i may depend only on pn and the 
"binary" information of whether there is a successful transmission in slot n or not. Then 
the pairs {Wn,Pn) form a time homogeneous Markov chain. 

The open question is: in the class of procedures (protocols) described above, does there 
exist a protocol which makes the Markov chain {Wn,Pn) positive recurrent (ergodic), for 
some positive A? See [l5] for more detail. 

It is known [T71I22] that if a choice ofpn+i is based on p„ and of either of two other "binary" 
informations (nth slot was empty or not; or there was collision in nth slot or not), then 
A < is necessary and sufficient for the existence of an ergodic protocol. 
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